Thai WordNet Construction
نویسندگان
چکیده
This paper describes semi-automatic construction of Thai WordNet and the applied method for Asian wordNet. Based on the Princeton WordNet, we develop a method in generating a WordNet by using an existing bi-lingual dictionary. We align the PWN synset to a bilingual dictionary through the English equivalent and its part-of-speech (POS), automatically. Manual translation is also employed after the alignment. We also develop a webbased collaborative workbench, called KUI (Knowledge Unifying Initiator), for revising the result of synset assignment and provide a framework to create Asian WordNet via the linkage through PWN synset.
منابع مشابه
Building a Gold Standard for Thai WordNet
This paper presents a method of building a gold standard test set of Thai WordNet. The results of this research can be utilised for evaluating or comparing the results from different approaches of Thai WordNet construction. In this research, a part of Thai WordNet is carefully handcrafted from Common Base Concepts’ FirstOrderEntities with five translation resources. However, we found that to bu...
متن کاملAutomatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملSemi-Automatic Semantic Relations Extraction from Thai Noun Phrases for Ontology Learning
The critical issue in ontology construction is to extract concepts and identify ontological relations both in taxonomic and other semantic relations. In large and various domains, this task can be time-consuming and costly. In this paper, we propose the methodology to discover semantic relations embedded in Thai NPs in order to enrich the existing domain ontologies by using machine learning tec...
متن کاملSemantic Similarity Measures for the Development of Thai Dialog System
Semantic similarity plays an important role in a number of applications including information extraction, information retrieval, document clustering and ontology learning. Most work has concentrated on English and other European languages. However, for the Thai language, there has been no research about word semantic similarity. This paper presents an experiment and benchmark data sets investig...
متن کاملParticipation in Language Resource Development and Sharing
Language resources are really much required for understanding and modeling the language in the present approaches. The language that has a rich language resource gains a big benefit in making a big advance in language processing. On the other hand, the less resource language is struggling with preparing a large enough language resource such as raw text or annotated corpora. It is a labor intens...
متن کامل